Three-stage quality control strategies for DNA re-sequencing data
نویسندگان
چکیده
Advances in next-generation sequencing (NGS) technologies have greatly improved our ability to detect genomic variants for biomedical research. In particular, NGS technologies have been recently applied with great success to the discovery of mutations associated with the growth of various tumours and in rare Mendelian diseases. The advance in NGS technologies has also created significant challenges in bioinformatics. One of the major challenges is quality control of the sequencing data. In this review, we discuss the proper quality control procedures and parameters for Illumina technology-based human DNA re-sequencing at three different stages of sequencing: raw data, alignment and variant calling. Monitoring quality control metrics at each of the three stages of NGS data provides unique and independent evaluations of data quality from differing perspectives. Properly conducting quality control protocols at all three stages and correctly interpreting the quality control results are crucial to ensure a successful and meaningful study.
منابع مشابه
Strategies and Clinical Applications of Next Generation Sequencing
Abstract DNA sequencing is one of the great valuable techniques in molecular biology, which can be used to detect the sequence of nucleotides in a DNA fragment. The high-throughput sequencing known as Next Generation Sequencing (NGS) revolutionized genomic research and molecular biology; therefore, the whole human genome can be sequenced with a low cost in several days. NGS technology is simi...
متن کاملStrategies and Clinical Applications of Next Generation Sequencing
Abstract DNA sequencing is one of the great valuable techniques in molecular biology, which can be used to detect the sequence of nucleotides in a DNA fragment. The high-throughput sequencing known as Next Generation Sequencing (NGS) revolutionized genomic research and molecular biology; therefore, the whole human genome can be sequenced with a low cost in several days. NGS technology is simi...
متن کاملAlmostSignificant: simplifying quality control of high-throughput sequencing data
MOTIVATION The current generation of DNA sequencing technologies produce a large amount of data quickly. All of these data need to pass some form of quality control (QC) processing and checking before they can be used for any analysis. The large number of samples that are run through Illumina sequencing machines makes the process of QC an onerous and time-consuming task that requires multiple p...
متن کاملDetermination of genomic copy number alteration emphasizing a restriction site-based strategy of genome re-sequencing
MOTIVATION Copy number abbreviation (CNA) is one type of genomic aberration that is often induced by genome instability and is associated with diseases such as cancer. Determination of the genome-wide CNA profile is an important step in identifying the underlying mutation mechanisms. Genomic data based on next-generation sequencing technology are particularly suitable for determination of high-...
متن کاملPrediction of job stress in teachers based on forgiveness and thought control strategies and their relationships with stress
Introduction: Today, because of stress and changes in lifestyle, stress has become a common phenomenon and complex. The aim of the present study was to the prediction of job stress in teachers based on forgiveness and thought control strategies and their relationships with stress. Method: The population consisted of all primary school teachers in the academic year 2015-2016 and according to Mor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Briefings in bioinformatics
دوره 15 6 شماره
صفحات -
تاریخ انتشار 2014